Cross-lingual acoustic model adaptation based on transfer vector field smoothing with MAP
نویسندگان
چکیده
We propose a method to adapt acoustic models for robust speech recognition in real environments using data from other languages. In real-world speech recognition systems, we can effectively adapt acoustic models using the speech data logged by the system. However, when developing a system for a new language, this step is impossible since we have no such speech data for it. Assuming that similar Gaussians of each language have similar transfer vectors, in our proposed method, we estimate the transfer vectors of each Gaussian of the language for acoustic model adaptation by the transfer vectors of the other language. We evaluated the performance of Indonesian acoustic models that were adapted using the transfer vectors estimated from Japanese transfer vectors. Our proposed method achieved a relative error reduction rate of 10.6% for real environmental speech data. Index Terms acoustic model adaptation, cross-lingual, MAPVFS
منابع مشابه
Differential Approach to Acoustic Model Adaptation
This paper discusses a ‘differential approach’ to acoustic model adaptation to different channel, noise and speaker conditions for speech recognition with a concept of vector field adaptation. Adaptation of acoustic model, such as HMM output probability densities, is modeled as a vector field in the acoustic feature vector space which effects on the models to move along the local vector directi...
متن کاملAnalytic Methods for Acoustic Model Adaptation: A Review
This paper discusses analytic methods of acoustic model adaptation for automatic speech recognition and reviews other major methods. The main purpose of this paper is to demonstrate the potential of analytic approach for model adaptation. As an example of analytic methods, Jacobian Adaptation (JA) is intensively discussed and its potential of applicability to speech recognition problems is reve...
متن کاملCross-lingual acoustic modeling for dialectal Arabic speech recognition
Amajor problem with dialectal Arabic acoustic modeling is due to the very sparse available speech resources. In this paper, we have chosen Egyptian Colloquial Arabic (ECA) as a typical dialect. In order to benefit from existing Modern Standard Arabic (MSA) resources, a cross-lingual acoustic modeling approach is proposed that is based on supervised model adaptation. MSA acoustic models were ada...
متن کاملAcoustic model adaptation based on coarse/fine training of transfer vectors and its application to a speaker adaptation task
In this paper, we propose a novel adaptation technique based on coarse/fine training of transfer vectors. We focus on transfer vector estimation of a Gaussian mean from an initial model to an adapted model. The transfer vector is decomposed into a direction vector and a scaling factor. By using tied-Gaussian class (coarse class) estimation for the direction vector, and by using individual Gauss...
متن کاملMultilingual Training and Cross-lingual Adaptation on CTC-based Acoustic Model
Phoneme-based multilingual training and different crosslingual adaptation techniques for Automatic Speech Recognition (ASR) are explored in Connectionist Temporal Classification (CTC)-based systems. The multilingual model is trained to model a universal IPA-based phone set using CTC loss function. While the same IPA symbol may not correspond to acoustic similarity, Learning Hidden Unit Contribu...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013